WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences
نویسندگان
چکیده
منابع مشابه
WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences.
We present here a fast and sensitive method designed to isolate short nucleotide sequences which have non-random statistical properties and may thus be biologically active. It is based on a first order Markov analysis and allows us to detect statistically significant sequence motifs from six to ten nucleotides long which are significantly shared (or avoided) in the sequences under investigation...
متن کاملIdentifying DNA and protein patterns with statistically significant alignments of multiple sequences
MOTIVATION Molecular biologists frequently can obtain interesting insight by aligning a set of related DNA, RNA or protein sequences. Such alignments can be used to determine either evolutionary or functional relationships. Our interest is in identifying functional relationships. Unless the sequences are very similar, it is necessary to have a specific strategy for measuring-or scoring-the rela...
متن کاملDevelopment of an Efficient Hybrid Method for Motif Discovery in DNA Sequences
This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...
متن کاملAn Efficient Procedure for Mining Statistically Significant Frequent Itemsets
We suggest the original procedure for frequent itemsets generation, which is more efficient than the appropriate procedure of the well known Apriori algorithm. The correctness of the procedure is based on a special structure called Rymon tree. For its implementation, we suggest a modified sort-merge-join algorithm. Finally, we explain how the support measure, which is used in Apriori algorithm,...
متن کاملAn Efficient Algorithm for Small Gene Prediction in DNA Sequences
ISSN: 2186-1390 http://cennser.org/IJCVSP Abstract The main purpose of this paper is to introduce a new method for gene prediction in DNA sequences based on the period-3 property in exons. First, the symbolic DNA sequences converted to digital signal by using maximum homogeny estimation modeling method. Then, to reduce the effect of background noise in the period-3 spectrum, we have used the di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nucleic Acids Research
سال: 1992
ISSN: 0305-1048,1362-4962
DOI: 10.1093/nar/20.11.2871